|
|
|
| Home | Wayback Machine | Archive-It | Blog | Heritrix |
| Anonymous User (login or join us) |
Internet Archive crawldata from Aaron Swartz Crawl, captured by crawl345.us.archive.org:aaronswartz from Mon Jan 21 08:49:07 PST 2013 to Mon Jan 21 01:18:18 PST 2013.
This item is part of the collection: Away from Keyboard: Aaron H. Swartz
Identifier: AS-20130121084907-crawl345
Contributor: Internet Archive
Crawljob: aaronswartz
Creator: Internet Archive
Date: 2013
Firstfiledate: 20130121085002
Firstfileserial: 02374
Identifier-access: http://www.archive.org/details/AS-20130121084907-crawl345
Lastdate: 20130121011818
Lastfiledate: 20130121091626
Lastfileserial: 02382
Mediatype: web
Numwarcs: 9
Operator: lekash@archive.org
Scandate: 20130121085002
Scanner: crawl345.us.archive.org
Scanningcenter: sanfrancisco
Sizehint: 9185989451
Sponsor: Internet Archive
Publicdate: 2013-01-21 21:46:30
Addeddate: 2013-01-21 21:46:30
Imagecount: 139408
Keywords: crawldata
| Information | Format | Size |
| AS-20130121084907-crawl345_files.xml | Metadata | [file] |
| AS-20130121084907-crawl345_meta.xml | Metadata | 1.4 KB |
| Other Files | Web ARChive GZ | WARC CDX Index | Item CDX Index | Item CDX Meta-Index | Text |
| AS-20130121084907-02374.warc.gz |
978.0 MB
|
921.6 KB
|
|||
| AS-20130121084907-crawl345.cdx.gz |
7.7 MB
|
||||
| AS-20130121084907-crawl345.cdx.idx |
6.5 KB
|
||||
| AS-20130121085148-02375.warc.gz |
953.8 MB
|
1.2 MB
|
|||
| AS-20130121085451-02376.warc.gz |
953.7 MB
|
1.5 MB
|
|||
| AS-20130121085757-02377.warc.gz |
977.1 MB
|
528.9 KB
|
|||
| AS-20130121090056-02378.warc.gz |
955.6 MB
|
939.3 KB
|
|||
| AS-20130121090426-02379.warc.gz |
955.3 MB
|
868.6 KB
|
|||
| AS-20130121090751-02380.warc.gz |
955.9 MB
|
1.2 MB
|
|||
| AS-20130121091217-02381.warc.gz |
957.8 MB
|
529.6 KB
|
|||
| AS-20130121091538-02382.warc.gz |
1.0 GB
|
768.1 KB
|
|||
| MANIFEST.txt |
594.0 B
|